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Echo canceller having improved model mismatch compensation 



The present invention relates to an interference canceller comprising an 
adaptive filter for modeling an actual interference, and a spectral processor for processing the 
modeled interference together with near end speech and the actual interference. 

The present invention also relates to a system, in particular a communication 
5 system, for example a bands-free communication device, such as a mobile telephone, a 

speech recognition system or a voice controlled system, which system is provided with such 
an interference canceller, to a method for canceling an interference, such as for example echo 
and/or noise, and to signals suited for use in the interference canceller. 

io : ,, 

Such an interference canceller, system and method are known from 
W097/45995 (=EP-A-0843934). The known interference canceller has a fer end input for an 
other communicating party, a near end output for a loudspeaker, a near end input for ft local 
audio microphone, andafkr end output to the other party. The interference canceller 

15 comprises an adaptive filter coupled to the loudspeaker and to the microphone, and a spectral 
residual interference processor coupled to the adaptive filter and to the microphone, The 
adaptive filter models an actual interference, such as an echo between the loudspeaker and 
the microphone in order to compensate the actual echo. The spectral processor then acts as a 
dynamic echo post processor for suppressing a residual echo or echo tail part not 

20 compensated for by the adaptive filter. At the start of a communication session between the 
fer end party and the local party the respective adaptive filters are not capable of modeling 
the echo patib between loudspeaker and microphone due to a lack of a sufficient far end input 
signal This means that initially also the spectral processor receives no or only inadequate 
information about modeled echoes. So initially there is no interference cancellation at all and 

25 in this phase of the communication session echoes are not compensated sufficiently accurate. 
Only after some exchange of speech between the parties will the adaptive filters have been 
converged to a state of steady interference compensation, followed by a steady operation of 
the spectral interference post processor. 
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Therefore it Is an object of the present invention to provide an interference 
canceller, whose capabilities of fast and accurately adjusting to changing communication 
conditions such as arising at the start of a communication session are improved. 
5 Thereto the interference canceller according to the invention is characterized 

in that the interference canceller further comprises an interference model mismatch 
compensator coupled to the adaptive filter for providing a mismatch signal for the spectral 
processor, said mismatch signal showing a speech independent decay* 

Similarly the method according to the invention is characterized in that an 
10 interference model mismatch signal is used for modeling the interference, which mismatch 
signal shows a speech independent decay. 

The inventors found that although far end speech is required for the 
interference cancellers for starting up interference model, in particular echo and/or noise 
building by the adaptive filters, it is this same wanted speech -that is to be sent to the far aid 
15 party- that is poorly echo compensated by the echo cancellers at the initial stage of the 
communication session, In addition the presence of near end speech at the initial stage 
precludes a fast convergence of the acquisition process performed by the adaptive filters and 
the spectral processors. Therefore an interference mismatch compensator is proposed, whose 
decaying interference compensation features do not depend on the wanted speech. 
20 Advantageously this results in a faster convergence of the interference modeling process, 
which is particularly important at the start up of the communication or after recovery thereof. 
Furthermore it safeguards a faster tracking and interference suppression in case of changing 
communication conditions, such as may arise when loudspeaker volumes, or interference 
properties in the room change. It is a further advantage that after such changes a more 
25 accurate interference canceling will be reached earlier in time* 

Finally it is important to note that the solution presented here does not require 
application of a speech detector, which leads to a less critical, simplified and cost effective 
operation of the interference canceller according to the invention. 

An embodiment of the interference canceller according to the present 
30 invention has the characterizing features outlined in claim 2, 

The fast and accurately established interference modeling can advantageously 
be used by the step size estimator for quickly and reliably optimizing step size control for the 
adaptive filter. 
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Another embodiment of the interference canceller according to the invention 
has the characterizing features of claim 3. 

Advantageously the ratio of a spectral measure of the near end speech and 
interference, and the modeled echo of the adaptive filter can be used for implementing the 
5 speech independent mismatch signal 

Speech independence can be acquired by making use of a pause in speech, 
such that the minimum of said ratio is determined over a time span, wherein the near end 
signal only comprises interference, in particular echo and/or noise. Such atime span 
preferably lasts at least 4 to 5 seconds. 
10 Jn general the mentioned spectral measure is defined by some positive 

function of the spectral power concerned, such as the spectral magnitude, the squared spectral 
magnitude, the power spectral density or the Mel-scale spectral density. 

At present the interference canceller, system and method according to the 
invention will be elucidated further together with their additional advantages, while reference . 
IS is being mad e to the appended drawing, wherein similar components are b eing referred toby 
means of the same reference numerals. 



In the drawing: 

20 Fig. 1 shows a general outline of an interference canceller according to the 

prior art; 

Fig. 2 shows an embodiment of a spectral processor for application in the 

interference canceller according to the invention; 

Fig, 3 shows a block diagram of a detailed interference model mismatch 
25 compensator for application in the interference canceller according to the invention; 

Fig. 4 show a graphical representation of the operation of the interference 
model mismatch compensator of Fig. 3, in the form of an echo model mismatch compensator, 
against time in the initial phase, where the adaptive filter model starts from all zero 
coefficients. 

30 Fig. 5 shows an embodiment of the interference canceller according to the 

invention in the form of a noise canceller, wherein the loudspeaker in Fig. 1 has been 
replaced by a reference microphone; and 

Fig- 6 shows an embodiment like Fig. 5 having a beam former. 
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Fig. I shows a shows a general outline of an interference canceller 1 at first to 
be described while embodied as an Acoustic Echo Canceller (ABC) L Such an AEC 1 is an 
important component in nowadays mostly fiiU duplex communication systems* such as for 
5 eacample a speaksrphone device, teleconferencing device* a telephone device, in particular a 
mobile telephone, a hands-free telephone or the Hke. In modern handsets, where a 
loudspeaker 2 and a microphone 3 are coupled to the AEC 1 and generally are mounted very 
close together, such an AEC removes annoying local echoes. The same applies for a 
teleconference device where mostly one or more loudspeakers and microphones are coupled 
10 to the AEC 1. 

Fig. 1 shows a signal x(Tc] coining from a far end, which signal is reproduced 
by the loudspeaker 2 at the near end side. The index k indicates that the signal x is sampled* 
Apart from speech spc] mainly originating from a near end speaker the microphone 3 also 
senses a signal ypc] comprising a reverberated far end echo generated through an echo path 

15 from the loudspeaker 2 to the microphone 3. So for a micro phone signal z[k] at the near end 
it holds that z[k] = s[k] + y[k] (if noise n[k] is neglected). The AEC 1 operates by means of 
an adaptive filter 4 to generate an echo estimate signal f\k] 9 which if subtracted from z[k] in 
adder 5 reveals a signal r[k]> which ideally does not contain the echo signal y[k] . Ideally the 
signal r[k], which may be the output signal of the AEC 1, only comprises the wanted local 

20 near end signal s[k], hereafter called the speech signal. Thereto the adaptive filter 4 models 
the. echo path represented by the echo estimate signal $[k]. It is noted that two AECs are 
required at the far end and at the near end respectively in a communication device or 
communication network. 

The AECs 1 operation may be extended by including a residual echo processor 

25 6 therein, bx that case the signal r'Pc] is the output signal of the AEC 1. In practice the 

adaptive filter 4 is not always able to accurately model the transfer function of the acoustic 
path between the loudspeaker 2 and the microphone 3 due to its finite digital filter length, 
tracking problems and non linear effects. Processor 6 being a post processor has the 
important advantage that it provides sufficient echo suppression and robustness at all times. 

30 The output signal of the echo post processor 6 indicated r' [k] is coupled to the far end. The 
operation of the post processor 6 is considered to be well known, but can for example be 
taken from EP-A-0 843 934, whose disclosure is supposed to be include herein by reference 
thereto. Principally the AEC 1 may be of an arbitrary adaptive filter type. Examples of 
suitable algorithms for adjusting coefficients of the echo canceller are; the Least Mean 
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Square (LMS) or Normalized IMS algorithm, or the Recursive Least Square (RLS) 
algorithm. 

At the start of a communication session between the far end speaker and the 
near end speaker the adaptive filter 4 and thereafter the spectral processor 6 start to converge 
5 to a model of the acoustic impulse response between the loudspeaker 2 and the microphone 3 . 
Depending on the signal type of the far end speaker, the length of the adaptive filter 4 and the 
Step size used in the algorithm it will take some time for the adaptive filter 4 to converge, 
usually a couple of seconds. During this time the echo suppression -if at all present- is poor 
resulting in an unpleasant start of the communication between the parties. The general 

1 0 problem is to obtain an accurate spectrum estimate of the echo present in the signal 

originating from the microphone 3 as quickly as possible. Only thereafter residual echoes can 
be suppressed by the echo suppressor 8, followed by a control of the step size in order to 
optimize the step size. These problems are difficult to solve if it would be necessary to make 
' use of a complex and critical speech detector. 

15 Thereto use is made of an echo model mismatch compensator 7, inclu ded in 

the overall scheme of the echo canceller of Fig. 2. The compensator 7 is detailed in Fig. 3. 
Fig. 2 shows respective signal analysis blocks A performing a spectral analysis and 
conversion on each of the above mentioned signals rjk] 3 z[k]> and y[k]. The conversions 
result in amplitude and phase representations of these signals, schematically indicated p and 

20 9 respectively. Only the phase <?(R) of processor input signal r[k] is used, together with a 
modified power spectrum R'madQO for reconstruction of the output signal r'Pc] by a synthesis 
block S. The modification ofR' mo d(k) will be explained later. Long after the start of a 
communication session, that is in case of a steady state, the adaptive filter 4 has already 
converged and R'mod(k) then represents unmodified former spectral values R, wherein 

25 residual echoes are being suppressed by residual spectral echo suppressor 8, while use is 
made of ^<k)„ Because in that case the output 1ta>a(k) of the echo model mismatch 
compensator 7 equals its unmodified input ^(k), which represents the power spectral part of 
the estimated output signal $\k] of the converged adaptive filter 4. From Fig. 3 it can be seen 
that the frequency dependent model mismatch estimate G defined by l^gdl" G |Y|, equals 1 

30 for all frequency bins, long after the start of the communication session. It is now proposed to 
calculate the estimate G for each frequency bin j, in case if[(k-i)B]j > 0, in accordance with; 

G(kB]j - minfc {<u W} {|Z[^i)B]j|ylt[(k^)Blj|> (1) 
where Zj and % represent the spectral amplitude in frequency bin j of the microphone signal 
z[k] and the adaptive filter output signal £[k] respectively, and where *min' means that the 
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minimum of the absolute value ratio between accolades is tracked over a time span covering 
a number of L time frames out of a total of k blocks having a block size B. If it holds that 
$X(k-i)Blj = 0 than the absolute value ratio in equation (1) above is set to infinity. The effect 
of applying the m^lmum tracking operation in equation (1) is that a presence of local near 
5 end speech (shown dotted in Pig- 4) will not lead to an increase of G[kB]j and thus to an 
unwanted upward bias of the echo model mismatch estimate. So at the start of the 
commuaication session speech will not adversely influence the building up of the echo 
model, because use is made of equation (1) generating a mismatch signal represented by 
r^madl, which signal shows a decay which is speech independent 
10 This behavior is graphically displayed in Pig. 4, which shows the schematic 

decay of the estimate G as a function of time. Herein the not increasing -flat- parts of G 
represent periods of speech, whose adverse effects on the model of the echo estimate are 
flattened out This leads to undistorted speech. 

hi situations without any excitation or speech in any frequency bin j for a 
15 period longer that L frames the model mismatch is set to infinity. Since the amount of echo 
suppression by the spectral post processor 6 is zero in case of zero excitation, there will be no 
distortion of the wanted signaL 

The time span covered by equation (1) preferably contains at least one pause 
in the speech. In practice the time span will last at least 4 to 5 seconds. The echo model 
20 mismatch compensator 7 may comprise well known shift registers, which may store 

consecutive calculated .values of the ratios numerator ^denominator. 

Fig. 2 also shows that the echo canceller 1 comprises a step size estimator 8 in 
particular coupled to the echo model mismatch compensator 7. This has the additional effect 
that -also- during start up of the communication session the step size used in the algorithm 
25 can be optimized at an earlier stage in the communication- This early optimization is 
independent from the applied step size control or from the way the step size estimator 8 
operates . As far as the estimator 8 for made use of 1f for every frequency bin, this quantity 
may simply be replaced by the associated ^mod specified above, in order to reveal the 
advantageous results during the start up of the communication session. Without going into 
30 further details the step size can be optimized in terms of^mod for effecting a full band 

optimal step size, both during startup and consequent steady state. Furthermore the step size 
control can be realized in a frequency dependent manner. 

Equivalent to the echo model mismatch estimate described above a noise 
cancellation term can similarly be proposed in order to introduce an associated noise model 
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mismatch estimate, either or not combined with, the echo model mismatch estimate. Figs. 5 
and 6 shows respective embodiments of the interference canceller 1 now to he described, 
■which ate embodied as anoise canceller 1. Also referring to Kg. 1 it can be seen that the 
overall view is quite similar to the build ups of Figs . 5 and 6, except that the loudspeaker 2 
5 has been replaced by a reference signal microphone 9. The microphone 9 senses the reference 
signal, now the noise signal, which noise signal together with the speech is sensed by the 
microphone 3. The adaptive filter 4 models the noise path between microphones 9 and 3, The 
signal z\k] now contains the speech s[k] and noise nftcj. The noise estimate n[k] modeled by 
me adaptive filter 4 may be processed in a similar way as y[k] in the post processor 
10 embodiment of Fig. 2. So finally echo and/or noise are treated similarly. 

Fig. 6 shows an embodiment of the interference canceller 1 in the form of a 
noise canceller, wherein the loudspeaker in Fig. 1 has been replaced by a reference 
microphone 9, which apart from noise n[k] also senses part of the speech s|Tc]. Like in Fig. 5 
microphone 3 senses speech and noise. A beam former 10 is included in the noise canceller 1 
15 for sep arating noise now included in a noise sign al n'pc] and a si gnal z'jk] comprising speech ; 
and noise. The operation of the beam former is known from WO 99/27522, whose disclosure 
is included here by reference thereto. The noise estimate fi*|k] is treated similar to the noise 
estimate fi[k] of Fig. 5. 

With the interference model mismatch estimate of the interference canceller 1, 
20 the echo and/or noise power spectrum can be estimated very accurately leading to a 

significant improvement of the acoustic canceller 1 in case the adaptive filter 4 is in an earlier 
state of convergence. Especially in the initial convergence phase a high quality operation of 
the acoustic canceller is important as this determines the first impression of the user. 

Whilst the above has been described with reference to essentially preferred 
25 embodiments and best possible modes h will be understood tiiat these embodiments are by no 
means to be construed as limiting examples of the systems and method concerned, because 
various modifications, features and combination of features falling within the scope of the 
appended claims are now within reach of the skilled person. 
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CLAIMS; 



1 . Interference canceller comprising an adaptive filter for modeling an 

interference, and a spectral processor for processing the modeled interference together with 
near end speech and the actual interference, characterized in that the interference canceller 
further comprises an interference model mismatch compensator coupled to the adaptive filter 
5 for providing a mismatch signal for the spectral processor, said mismatch signal showing a 
speech independent decay. 

X Interference canceller according to claim 2, characterized in that the 

interference canceller comprises a step size estimator coupled to the interference model 

10 



3. Interference canceller according to claim 1 or 2» characterized in that the 

interference model mismatch compensator is arranged for calculating an interference model 
mismatch estimate based on a minimum of the ratio of a spectral measure of the near end 
1 5 speech and actual interference, and the modeled interference of the adaptive filter. 

interference 
of said ratio is determined over a time span. 

20 5. Interference canceller according to claim 4 3 characterized in that the time span 

contains at least one pause in the speech. 

6. Interference canceller according to claim 4 or 5, characterized in that the lime 

span lasts at least 4 to 5 seconds. 



25 



7. IntBrfeorence canceller according to one of the claims 3-6, characterized in that 

the spectral measure is defined by some positive fimction of the spectral power concerned, 
such as the spectral magnitude, the squared spectral magnitude, the power spectral density or 
the Mel-scale spectral density. 
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8, Interference canceller according to one of the claims 1-7, characterized in that 

the interference canceller is embodied as an echo canceller and/or a noise canceller. 



10 



9 # System, in particular a communication system, for example a hands-firee 

commiimcation device, such as a mobile telephone, a speech recognition system or a voice 
controlled system, which system is provided with an interference canceller according to one 
of the claims 1-8, the interference canceller comprising an adaptive filter for modeling an 
actual interference, and a spectral processor for processing the modeled interference together 
with near end speech and the actual interference, characterized in that the interference 
canceller further comprises an interference model mismatch compensator coupled to the 
adaptive filter for providing a mismatch signal for the spectral processor, said mismatch 
signal showing a speech independent decay. 



IS 10. 



Method for cancelling an interference, whereby an actual inter ference is 



modeled and the modeled interference, together with near end speech and the actual 
interference are processed, chaiacterized in that an interference model mismatch signal is 
used for modeling the actual interference, which mismatch signal shows a speech 



20 

1 1 , Signals suited for use in the interference canceller according to one of the 

claims 1-8, the interference canceller comprising an adaptive filter for modeling an actual 
interference, and a spectral processor for processing the modeled interference together with 
near end speech and the actual interference, characterized in that the interference canceller 
25 further comprises an interference model mismatch compensator coupled to the adaptive filter 
for providing a mismatch signal to the spectral processor, said mismatch signal showing a 



016 16.07.2002 10 
10 12.07.2002 

ABSTRACT: 



An interference canceller is described, comprising an adaptive filter for 
modeling an interference, such as an echo or noise., and a spectral processor for processing 
the modeled interference together with near end speech and the interference. The interference 
canceller further comprises an interference model mismatch compensator coupled to the 
5 adaptive filter for providing a mismatch signal to the spectral processor, said mismatch signal 
showing a speech independent decay. With the model mismatch signal of the interference 
canceller fee interference power spectrum can be estimated very accurately leading to a 
significant convergence improvement of the acoustic canceller. Especially in the initial 
convergence phase at the start of a communication session a high quality operation of the 
1 0 acoustic canceller is achieved, which is important as this determines the first quality 
impression of the uses, 
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